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Dedicated to Yu. V. Prokhorov 
on the occasion of his 80-th birthday 

There are many unusual limit theorems in Number Theory which are well-known to experts 
in the field but not so well-known to probabilists. The purpose of this paper is to discuss some 
examples of such theorems. They were chosen in order to be close to the field of interest of Yu. V. 
Prokhorov. 

One of the main objects in Number Theory is the so-called Mobius function. It is defined as 
follows 

{1 if n=l; 

if n is not square- free; 

(— l) fc if n is the product of k distinct primes. 

Throughout the paper, when we write n = pip 2 ■ ■ - pk we assume that p\ < p 2 < ■ ■ ■ < Pk are the 
first k prime numbers. Many properties of the Mobius function are connected with the Riemann 
zeta function. For example, while the Prime Number Theorem is equivalent to the fact that 



5>(n) = o(J\0, 



n<N 



the Riemann Hypothesis is equivalent to 



n<N 



for every e > 0. 

Recently, a conjecture by Sarnak [13] has fostered a great interest towards the connections 
between the Mobius function and Ergodic Theory, and in particular the works of Furstenberg [7] 
and Green and Tao [9]. 
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1 A probabilistic model for square-free numbers 

Fix m > 1 and introduce the set fl m , whose elements have the form n = YYjLiPj 3 ? where 
Vj G {0, 1}. Then /i(n) = ±1 iff n £ Q m for some m. Define on Q m the probability distribution 
U m for which 

In (JTJ Z m is the normalizing factor and 

Z m = £ ff^ = nf 1 + -) = explain fl + i)j=exp(o(l) + f;i 

as m — > oo. Denote by iV(t) the number of primes which are less or equal than t. The Prime 
Number Theorem says that N(t) ~ ^ as t — > oo and a slightly stronger version asserts that 

N(t)-A = o(A~)- (2) 



hit \\n 2 t 

We can write, by summation by parts, 



in 



Pm 

i.e. Z m ~ 0(l)lnp m . A more precise asymptotic follows from Mertens' product formula [TT 



p<n 

where 7 is Euler-Mascheroni constant. In fact 



lim InnTT ( 1 - - J = e~ 7 « 0.561459, 
V P/ 



C(2) _1 

as n 00. 



Thus 

e 7 

Zm ~ C(2) lnPm ' ^ 
By analogy with Statistical Physics, Z m is called partition function. 

It is easy to check that w.r.t. IT m , the random variables Vj are independent and 

= 0} = n m {^- = 1} = — |— , 1 < j < m. 

i + Pj 1 + Pi 



Indeed, 



Z Z— ' FT 7 1 n u l TT' m 

™ yi,...,^_il/ i+ i,...,i/ m Hi=l ^ llz=j + l^ 

in) ni=j+i 



Since 



rip? = s E^ in ^ r ' 

i=i I i=i 



the statistical properties of n w.r.t. II m are determined by the properties of Y^j=i u j ^Pji which 
are sums of independent random variables. However the Central Limit Theorem cannot be applied 
here because Vj are not identically distributed. Instead, the following limit theorem is valid. 

Theorem 1.1. Let ( m = Y^jLi v i ^ n Pj- As m — > oo the distributions of Cm converge weakly 
to the infinitely divisible distribution whose characteristic function <p(\) has the form 



<p(\) = ex P { / dv } . (4) 




Proof. The characteristic function ip m of Q m is 

{ A m 1 m ( 1 Al 

^ e «, in B j = n (t^- + ttf/ 1 ^ 

iX In pj 
g lnpm — 1 



{Pm , 
J2(N(t)-N(t-l))]n[l 



1+t 

exp <j f m {p m + l)iV(p m ) - ^ N(t - l){f m {t + 1) - / m (t)) 



Pm 



t=l 



by summation by parts, where f m (s) = hi(l + ^ (e^p™ — lJJ. Since / m is complex-valued, 

the identity / m (t + 1) — J m (t) = f m (t + t) for some < r < 1 does not follow from the 
mean value theorem and we have to work with the real and imaginary parts separately. Writing 
fm = K/m + i^fm we have 



and (by choosing the principal branch of the natural logarithm) 



S/m(s) = arg 1 



l + s 



g In Pm 



sin 



arctan 



A In s 
lnp m 



COS 



A In s 
lnp m 



Now, by applying the mean value theorem twice to dtf m and $sf m separately, we get 

Uf m {t + 1) - Mf m (t) = (E/ m )'(t + n) = (H/ m )'(t) + n^W'it + t[) 

for some < r[ < T\ < 1, and 

£sf m (t + 1) - 5s(f m )(t) = (9/ m )'(t + r 2 ) = m m )'(t) + T 2 (5sf m )"(t + t' 2 ) 
for some < r 2 < r 2 < 1. Thus 

Pm 

In ^ m (A) = f m (p m + l)N(p m ) - £ JV(t - 1) (.C(t) + 7i(3fi/ m )"(t + r{) + r 2 (3/ m )"(t + r^)) . (5) 

We claim that the sum involving gives the main term. In fact, the first term and the 

other sums in (jHJ) tend to zero as m — > oo (see Appendix). Thus, the main term comes from the 
following sum: 

-g^- 1)&M --gfe + (gi)) . + A (^-i 



Pm / 1 
( Tw 



t=i 



tint 



g lnpm — X 



o 



t\n 2 t 



/ zA In t 
j g In p 7 n 



(2* 



t(t 

In t 
g In Pm 



1) 
1 



-g In Pm 

\np n 



g In Pn 



t + g ln Pm 



(t+l) S 



zA t 
\np m t + 1 



g lnpm 



(6) 



By opening the brackets in ([H]) we obtain twelve sums. Let us look at the first sum and consider 
the change of variables (which will be used in the Appendix too) v = v(t) 



hit 
hip,, 



for which 



dt> = v(t + 1) - v(t) = v'(t + r 3 ) = v'(t) + r 3 v"(t + Tg) for some < 7g < r 3 < 1. We get 



t=2 v v 



T 



(t + T>) 2 \np r 



- 1 



-dv 



as m — > oo since for some C > 

Pm 

£ 



iAu 



i=2 



(t + r') 2 ln^ 



< 



C|A| £ 



+2 



0. 



All the remaining eleven sums coming from fl6]) tend to zero (see Appendix) and this concludes 
the proof of Theorem 11.11 □ 



4 



Notice that 
cos(Aw) — 1 



dv 



cos u 



du — In v and lim 

U x^0+ 



COS u 

u 



du — In a; = 7, 



|A|« 



where 7 is the Euler-Mascheroni constant as before. Therefore the improper integral J Q J — dv 
converges to —7 — ^^du — In |A|. On the other hand 

\X\v 1 |A| 

sin(Ai>) f sinu f sin(Xv) [ smu 
dv = sgn(A) / du gives / dv = sgn(A) / du. 
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u 



This shows that 



cos tt 1 / sinu 
exp < — I 7 + / du + In A I + 1 / du 
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<pM = { 1 



COS 11 

exp < — I 7 + / du + ln(— A) 



A > , 

u 



A=0 , 



smu, 
I du > A < 0. 

u 



-X 



It is known (see pQ) that <p(\) is the characteristic function of the Dickman-De Bruijn distri- 
bution, with density e _7 p(t), where p(t) is determined by the initial condition 



P(t) 



0, t<0; 

1, < t < 1, 



(7) 



and the integral equation 



tp(t) = / p(s)ds, t e 



t-i 



It also satisfies the delay differential equation 

tp\t) +p(t-l) = 

for t > 1 (at t = 1 we consider the right derivative) and for every k = 1,2,3, .. . there is an 
analytic function Pk(t) that gives p(t) on k — 1 < t < k. For example, p\ = 1, = 1 — lnt and 
p 3 (t) = 1 — hit + J 2 * ln(tt — 1) — . It is also easy to see that p G C k ([k, 00)) for each k. 
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Among other properties of p(t) one can mention that it is log-concave on [1, oo) and 



pit) = exp { -t In t + In In t - 1 + hO 



hit V (M 2 

as t — > oo. In other words, the limiting density e~ 7 p(t) is constant on the interval (0, 1], where it 
takes the value e -7 , and decays faster then exponentially on (1, oo), like Poisson distribution. In 
particular, all its moments exist. 

The Dickman-De Bruijn density p first appeared in the theory of smooth numbers (i.e. num- 
bers with small prime factors). Let ^(x,y) denote the number of integers < x whose prime 
factors are < y. Dickman [3] showed that x 1//u ) ~ xp[u) as x — > oo. The range of y such 
that the asymptotic formula ^(x, y) ~ xp(u), where x = y u , has been significantly enlarged by De 
Bruijn [HI21[3] (y > exp((lnx) 5/8+£ )) and Hildebrand (y > exp ((In In x) 5/3+£ )). Notice that in 
our ensemble Q m (where each element is weighted, not simply counted) we have x = P1P2 • • -Pm 
and y = p m and thus y ~ lax. In this regime Erdos [6] showed that \n^(x,\nx) ~ l ^inx as 
x — > oo and therefore the asymptotic is no longer given by the function p. In other words a 
phase transition occurs in the asymptotic behavior of ^>(x,y). For a survey on the theoretical 
and computational aspects of smooth numbers see [8]. 

It is worth to mention that in many limit theorems in Number Theory there appear limit- 
ing densities which are constants on some interval starting at 0. An example can be found in 
the work of Elkies and McMullen [3] on the distribution of the gaps in the sequence { y/n mod 1}. 

Here is another example from Probability Theory where the Dickman-De Bruijn distribution 
appears. Let {rjj}j>i be a sequence of independent random variables such that 

P{Vk = k ) = \ and P iVk = 0} = 1 - p 



and let 6 n = X^Li Vj then 



X 

lim P{n' l e n <x} = e" 7 / p(t)dt. 







Theorem 11.11 has several important corollaries and applications. An immediate consequence 
of (ID is that 



s 



U m {n<p s m }= n ™(x) — >e^Jp(t)dt 







as m — > oo. For instance, for s = 2 we get e~ 7 (3 — In 4) w 0.90603. In other words, despite 
the fact that the largest element of our ensemble Q m is of order m m , approximately 90% of the 
"mass" of our probability distribution U m is concentrated on numbers less than p 2 m for large m. 
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Let us fix < a < 1 and decompose the interval (0, a) onto K equal intervals (6k,6k+i] 
6k = jf, k = 0, . . . , K — 1. For fixed K, Theorem O states that 



111 71 6 ^ (J 

n m { 6 k < < 4+1 > — > —rr- (8) 

\np m I iv 



as m — > oo. Let us consider the error term in ([HD 

Inn „ 1 e _7 cr 



{\T\ 71 
6 k < < 4+i 
m p m J 



A" 



In the rest of this paper we provide some estimates about the error terms Em\k,K) when K 
grows with n. We prove the following 

Theorem 1.2. For every e > and every function K{m) such that lim-m^oo j^^p = c > i/iere 
exists m* = m*(e, K) such that the inequalities 

3 7 K(m)-1 3 

-i^- £ -k S ^'(^w)^^ (9) 

/ioW /or every m > m* and every < o~ < 1 . 

An important tool in the proof of Theorem 11.21 is given by the counting function 

M m {t) = #{n < t : n E fi m } . 
This is analogous to the classical quantity 

M(t)=#{n<t: ^)/0}, 

for which the asymptotic 

|im MW = 1 = 6 

holds (see, e.g., [12]). Even though the ensemble fl m is very sparse, its initial segment of length 
p m contains all square- free numbers less or equal than p m . In particular linim^oo Mm< f m "> = — L- 
for every < a < 1. For a = 1 this fact can be rephrased as 



1 V - 2l 



lim — > u (n) 

™^p m f PU C 2 

n<p m 



and can be compared with 



lim ^ v ' = e 7 , 

m^oo In p m ^— ' n 

n<p m 



which is a corollary of our Theorem 11.11 

The following Lemma provides some simple estimates that will be used in the proof of Theorem 

o 
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Lemma 1.3. 

Q ^ YV*+i ° Prn - 1 < ^Pm^Pm <j{P m ~ 1) , , 

k=0 Fm 

(T 3 p a m In 2 p m a{p a m - 1) < Sh a p a m -\ 

12K 2 2K ~2^ p m K ln ^ u y ) 

k=0 ym 

Proof. The right (resp. left) Riemann sum Ylk=o P™ +1 Jc ( res P- 2~2k=o Pmjf) converges as K — > oo 
to the integral e <51np " 1 d5 = Moreover, since the function t i— )■ is increasing, the right 

(resp. left) sum is strictly bigger (resp. smaller) than the integral. This proves the first inequality 
in ( ffOj) and the second inequality in ( FlTT) . A classical result from Calculus states that in the 
absolute value of the error performed by approximating the integral J b f(x)dx by the trapezoidal 
Riemann sum 

\f{xo) + /(xi) + f(x 2 ) + ... + /(z#_i) + ^/(z*) ) Ax, 

Xfc = a + k^j^- is bounded by M< ^2 where sup a<x<fe \f"(x)\ < M. This implies that the error for 
the right Riemann sum 

(f(x 1 ) + ... + f(x K ))Ax 

is bounded from above by + (/(&) — ,/( a ))ior an d gives the second inequality of ( ITU]) when 

applied to the function t*-} p l m over the interval [0, a}. On the other hand, the error given by the 
left Riemann sum 

(f(x ) + ... + f(x K ^))Ax 
is bounded from below by — M ^ c °2 (/(&) ~ /( a ))w an d this gives the first inequality in 

IB. □ 



Proof of Theorem II. £1 



M m (p° m ) Z m ^ Z 



^ n?r m (n) = n?r m (n) < 



Pm Pm _ 

n £ S2 m ft— U n € S2 m 

n ^ pi'<n< pl fe+1 

Pm k=0 neO m Pm k=0 Pm ft=0 

P™ fc < n < pl fc+1 

Applying Lemma [1.31 to the right Riemann sum ^2f=Q Pm +1 ^ we obtain the estimate 



M m (p m ) / e~^ m p^-l e"^Z m fa 3 ln"p m p^-lcrlnp m \ , Z 



< 



P m lnp m lnp m V 12if 2 p m / p m 
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which is true for every m and K. Since, as m — > oo, ' m p f m — > ^2), — >■ ^y, and by 
hypothesis — >■ c (and thus — > 0), then for every e > the inequality 

A'(m)-1 3 



_ / a In p m 

holds true for sufficiently large m. By noticing that })m +1 = p„ ( 1 + (e K ( m > — 1)) and < 

a In P777. 

(e K ( m > — 1) — 7- as m — > 00, we obtain the first inequality of fl9]). On the other hand 

Vm Pm k=Q n£n m Pm k=0 Pm k=0 

and applying Lemma 11.31 to the left Riemann sum Y2k=o Pm we obtain the estimate 
M m(Pm) ^ e^Z m p a m - 1 e _7 Z m /V 3 In 3 p m t p a m - I a\ap m \ t Z m _ h 

k=0 

which is true for every m and K. Proceeding as above we have that for every e > the inequality 



( a 3 \np m p° m -la\np m \ Z m ^ s M 
Pm ~ lnp m Vm lnp m V 12^ 2 Pm 2K ) + p° m ^ Pm m{ ' J 



7 KM_1 rrr 3 



?Sj4 ^ (m)) < 



k=0 

holds for sufficiently large m and we have the second inequality of (J9]). □ 
An immediate consequence of Theorem 11.21 is the following 

Corollary 1.4. Consider a function K{m) such that linim^oo = c > 0. Then the sum of 

the error terms coming from with weights p m a+Sk , satisfies the asymptotic estimate 



^ Et\k,K(m)) = . (12) 



fc=o P m I / 1 



as m —7- 00 /or every < a < 1 . 

Notice that implied constant in the O- notation depends explicitly on c and a by (J3j) and ([9]). 

In Pm 

Moreover, as A; ranges from to K[m) — 1, the weights vary from (— > as m — > 00) to e K ( m > 
(—7- 1 as m — > 00). This means that the error terms Em\k, K(m)) corresponding to small values 
of k are allowed to be larger in absolute value. 
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to K ^ m j ) we just replace the weights p m a+5k by either p a or 1 in (Q. This yields, for every e and 



In order to get estimates on the mean value of the error term (for which al weights are equal 
sufficiently large m, 

3 \ K(m)-1 



Z m V 12C(2) 
In particular we get, as m — > oo, 



%H i°U#fcr) ifc>0; 



if c = 0. 



K(m) In 

Let us point out that, even though by ([S]) the error term Em (k, K(m)) tends to zero as m — >■ oo 
for each /c, it is not a priori true that (Em^) tends to zero as well. It follows from our Theorem 
11.21 that this is indeed the case when K( J^[ np remains bounded (i.e. a particular case of c = 0). 
Let us summarize this fact in the following 

Corollary 1.5. Let < a < 1 and consider a function K(m) such that linim^oo KI J^[ np < oo. 
Then, as m — > oo, 

- • (13) 

In other words, if K grows sufficiently fast (namely as const ■ or faster), then the mean 

value of the error {E&) tends to zero as m — > oo and the rate of convergence to zero is controlled 
explicitly in terms of a and K. 

Notice that one would expect the error term Em\k, K(m)) in (jSJ) to be °(^ K ^ m ^ j, however we 

could only derive the weaker asymptotic estimates ( fl2l and ( fl3|) from Theorem 11.21 A possible 
approach to further investigate the size of the error term in (jSj) would be to first prove an analogue 
of Theorem 11.11 for shrinking intervals. This is, however, beyond the aim of this paper. 



Appendix 

This Appendix contains the estimates for the error terms in the proof Theorem 11.11 By Cj, 
j = 1, . . . , 21, we will denote some positive constants. 

The first term of fl5]) tends to zero as m — > oo uniformly in A. In fact using fl2]) we obtain 

N( Pm ) + ^ + 2 ^ + X ) cos 



mUPm + l)N(p m ) = In 



(Pm + 2; 



2 



N{ "-> An (l + (±-\\- In ( 1 + o(±)\) = o(^)o(±-)=o( 1 



Pm) J V \PmJ J J V^PmJ \Pm J \ ln Pm 
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J 



and 



^sf m {Pm + l)N(p m ) = N(p m ) arctan 



sin 



\^ ln(p m +l) \ 



o 



Pi, 



\np r 



O 



p m + 1 + cos 



as m — > oo, and the implied constants do not depend on A. An explicit computation shows that 

W m )"(s) = ti\s) + f!n ) (s) + f^(s) 1 



where 



f£\s) 



2s + (1 + s 2 ) cos ( 



A In a 
In p m 



2 -° S fe) +1 ) ^ 



s s 



3s + 2s cos 



- l) sin f 



s ( s2 + 2scos (SS) + 1 ) lnp - 

S - 1 + (s 2 - 2s - 1) COS ( 



2 cos 



A In ; 
lnp„ 



- 1 ) ( S 3 - S 2 



A In s 
lnp m 



We have 



|/£>(a)| < 



(1 + s) 2 (s 2 + 2scos(Ag 
dA 2 



and thus 



s 3 In 2 p m ' 



|/i 2) (^)|<^ 

1 1 s d lnp„ 



t=l 

Pm 



< 



+2u; 



C 3 A 2 - 

^ 2 ln£ 



t=i 



< 



In p r , 



11 t=2 
Pot 



and 



as m —j- oo. 



The third function satisfies the estimate 



\f£\s)\ < 



2 cos 



A In; 
In p re 



+ S 2 C 5 C 6 ( 1 - COS 
— < 



A In a 
In p m 



(l + s) 2 (l-s) 4 S J 

We now perform the same change of variables v = v(t) = as before (using r 3 and as in 
the proof of Theorem 11.11) . We get 



Pm 



i=2 



Pm 1 — COS 



Alnj 
lnpm 



<C 8 Wd« 

11 \ 



73 



(t + r^ln^ 
11 



t 2 lnt 

1 — cos(Au) 
tv 



< 



as m — > oo. Another explicit computation shows that 

m rn ns) = f^( S ) + ft\s) + f£\s) 



where 



fS\s) 



fS\s) 



We have the estimates 



;2_l) sin (™£±) 
I \logp m J 



s (s 2 + 2s cos ^ 



A log s 

log p m 



2 i 

1 ) In 2 Pm 



1 + 5s 2 + 2s 2 cos 2 ^ 

gPm 



+ (3s 3 + 5s) cos ( 



A log s 
log p m 



s 2 (s 2 + 2scos(A^)+l) \n Pn 



2 ( s + cos 



A log s 
log p m 



sm 



A log s 

log p,„ 



|£ 4) (*)| < 



CinA 2 



s 3 In 2 



f£\s)\ < 



Pr, 



C n |A| 
s 3 In p„ 



and thus 



f=i 



< 



< 



Pm j 

E +217 



C 12 A 2 5 



- 1 t=2 

Pn 



In p n 

Ci 3 \X\ <p 1 



and 



as m — > oo. 



The estimate 



|/i 6) (*)| < 



C u s sin ("°&*) dssinf^Sf 

ylogp m y 1J \logPm 



(s-iy 



yields, as m — > oo, 

Pm 



t=i 



< 



Pm sin ( £M J / 

t=2 t> V 



73 



(t + T3) 2 lnp m y t 



sin(At>) 



-> 0. 



This concludes the analysis of the error terms coming from (JHJ) • 

Let us now deal with the error terms coming from One sum (giving the main term) is 
already discussed in the proof of Theorem 11.11 Amongst the remaining eleven sums coming from 
([6]), it is enough to check that the following three tend to zero as m — > oo (the other eight being 
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dominated by these): 



Pm 

E 

t=2 



1 (2t-l)(e T ^-l 



tint 



(t + 1) 2 

El 6 -eA In 
g Inpn 
t In 1 i + 1 



t=2 



In m 

Pm 

/ j j. 1 i ^A Int 

t +2 * + e ln *"™ 



+ 

g In p m — \ 



Pm 



( dw 



i \ In t 
g In Pm — ]_ 



Lot 

lnpm 



CirIAI ^ 1 



_L < ^18| 

t 2 \ap m ~ lnp m ^ t 2 



< 



i=2 



g Inpn 



(t + r') 2 ln^ 

2 



E 

t=2 

>0, 



Lot 
lnp™ 



hit , C 21 A 2 y^lni 



0, 



0. 
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